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A FURTHER NOTE ON THE AGE INDEX OF A POPULATION 1 

By Raymond Pearl and T. J. LeBlanc 

School of Hygiene and Public Health, Johns Hopkins 
Communicated September 7, 1922 

In an earlier paper in these Proceedings, Pearl 2 proposed the function 
* = 5 {-f }(M -M P ) 

as a single numerical index of the age distribution of a population. In 
this expression 5 indicates summation, for all age groups given in the origi- 
nal statistics, of the expression in brackets; A is the deviation, for each 
age group, of the percentage of the actual population in each age group 
from the percentage of the same group in the standard population of 
Glovers 8 Life Table, denoted by P; M = mean age of living population ; 
M p = mean age of persons in a stationary population unaffected by 
migration. 

In connection with certain problems now under investigation in this 
laboratory it becomes a matter of importance to know how sensitive this 
age index is to change in the size of the age groups of the original sta- 
tistics. Since its original proposal the index has been used in many stud- 
ies in this laboratory, and the larger experience has strengthened our 
confidence in its reliability as an index of significant variations in the age 
constitutions of populations. But it has always been used hitherto with 
at least 6 to 8 age groups covering the life span. Suppose the original 
statistics furnish only 3 age classes for the entire life span. Will this age 
index <j> then give a reliable picture of the significant variations in age dis- 
tribution, as we pass from city to city, or county to county? 

To test this point, the obvious thing to do is to determine the correlation 
between the age indices for n communities, on the basis of say 3 divisions 
of the life span, with the age indices for the same communities on the basis 
of say 6 divisions of the life span. If the correlation is high it will mean 
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that even with the coarse grouping the index is reflecting the main dif- 
ferences in the form of the age distributions. 

The present note reports a test of this sort. The 1915 census of Iowa 4 
gives in Table I the age distribution of the inhabitants of each of the 99 
counties of the state in such form as to permit the formation of the follow- 
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FIGURE 1 

Spot diagram showing the correlation between <fo and <£j. Each dot represents one 
county. 

ing classes: 0-4, 5-9, 10-17, 18-20, 21-44, 45 and over, both figures being 
inclusive in each case. For each of the 99 counties was computed on 
the basis of this grouping of the ages. Call these values 4>i- Then for 
the same 99 counties the population was combined by addition into the 3 
age groups 0-9, 10-20 and 21 and over. A new set of <t>'s was then calcu- 
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lated on the basis of this coarse grouping. Call these values fa. Then 
what we wish to know is the value of the correlation r^,. 

This was calculated by the direct product moment method, without 
grouping and gave as a result 



'*102 



= + .84 ± .02. 



It is at once evident that we are dealing here with a high degree of 
correlation. That the regression is linear is indicated by the spot dia- 
gram of the correlation surface shown in figure 1. 

This result indicates that whenever it is impossible, by reason of un- 
satisfactory tabulation of the original raw statistics, to get any but a 
coarse age grouping, we may still use the age index <f> with confidence that 
it will reflect closely the differences in population in respect of age distri- 
bution. The test here given is obviously a severe one, because all adult 
ages are thrown into one group. But even so, the correlation is high. 
With such a value of r as this, it is obvious that one could, by means of the 
appropriate regression equation, determine corrections for the fa values, 
giving predicted or calculated fa's which would with extreme accuracy 
approximate the true values. 

The means and standard deviations are as follows : 

Mean fa = 27.27 =■= 1.46 

Standard deviation fa = 21.53 ± 1.03 

Mean fa = 19.71 ± 1.23 

Standard deviation fa = 18.14 ± .87 

As would be expected the fa mean is lower than the fa mean, and fa is 
less variable, as indicated by the standard deviation, than fa. The 
difference in the means is large, but this is of no great importance in the 
practical use of the age index. It is of much greater practical significance 
that the difference in the variabilities is not large. If we could trust here 
the theory of simple sampling (which we probably cannot) the difference 
between the standard deviations could not be regarded as certainly signifi- 
cant at all. The more important consideration is that fa is reflecting in 
its standard deviation 84.2 per cent of the variation among these 99 Iowa 
counties in respect of age distribution of their populations, as measured 
by fa. 

In conclusion it may be stated that since, as is demonstrated in this 
paper, there is a high correlation between the values of <j> for extremely 
coarse age groupings, with the values obtained from finer groupings, this 
age index may be used with considerable confidence in cases where data on 
age distribution are available only for a few broad classes covering the life 
span. All our experience with this age index <£ since its publication two years 
ago has confirmed our belief that it is a function of great value to the vital 



Vol. 8, 1922 CHEMISTRY: NOYES AND WILSON 303 

statistician who wishes to make analytical studies of medical and hygienic 
data by modern statistical methods. 

1 Papers from the Department of Biometry and Vital Statistics, School of Hygiene and 
Public Health, Johns Hopkins University, No. 69. 

2 Pearl, R. Proc. Nat. Acad. Sci., 6, pp. 427-431, 1920. 

3 Glover, J. W. United States Life Tables, igio. Bureau of the Census, 1916. 

4 Census of Iowa, 1915. Published by the Executive Council. 



THE THERMAL IONIZA TION OF GASEOUS ELEMENTS A T HIGH 

TEMPERATURES 

A CONFIRMATION OF THE SAHA THEORY 

By Arthur A. Noyes and H. A. Wilson 

Gates Chemical Laboratory, California Institute of Technology, and Physics 
Laboratory op the Rice Institute 

Read before the Academy, April 25, 1922 

In a recent article Eggert 1 has shown that the ordinary thermodynamic 
expression for the effect of temperature on the equilibrium of chemical re- 
actions, when supplemented by assumptions as to the values of the special 
constants occurring in it, can be applied to the calculation of the extent 
to which under different conditions neutral atoms of the gaseous elements 
are converted into positive ions and free electrons. Saha 2 has shown 
that the most uncertain factor in Eggert' s formulation can be eliminated 
by introducing the ionization-potential of the element. He has then 
calculated the dissociation of the first electron from the atoms of many 
important elements, thus, the extent to which such reactions as Na = Na + 
+ E~ take place, where E~ represents electron gas. He has tabulated the 
values of this percentage ionization at various temperatures and pressures; 
and he has especially pointed out the great significance which these con- 
siderations may have for the interpretation of the spectra of elements 
under solar and stellar conditions. 

These principles are of great interest to astrophysicists, as may be 
illustrated by the following applications. It has long been known that 
the solar prominences show strongly the hydrogen and helium lines and 
the enhanced (H and K) lines of calcium, but not the blue line of calcium 
nor the familiar lines of sodium or of other alkali elements. The re- 
markable absence of these lines had not been explained; but the thermo- 
dynamic relations now show that the extremely small pressure combined 
with the high temperature of the prominences must greatly promote the 



